Physiologically Motivated Modelling of the Voice Source in Articulatory Analysis/synthesis

نویسنده

Bert Cranen

چکیده

This paper describes the implementation of a new parametric model of the glottal geometry aimed at improving male and female speech synthesis in the framework of articulatory analysis synthesis. The model represents glottal geometry in terms of inlet and outlet area waveforms and is controlled by parameters that are tightly coupled to physiology, such as vocal fold abduction. It is embedded in an articulatory analysis synthesis system (articulatory speech mimic). To introduce naturally occurring details in our synthetic glottal flow waveforms, we modelled two different kinds of leakage: a “ linked leak” and a “ parallel chink” . While the first is basically an incomplete glottal closure, the latter models a second glottal duct that is independent of the membranous (vibrating) part of the glottis. Characteristic for both types of leaks is that they increase dc-flow and source/tract interaction. A linked leak, however, gives rise to a steeper roll-off of the entire glottal flow spectrum, whereas a parallel chink decreases the energy of the lower frequencies more than the higher frequencies. In fact, for a parallel chink, the slope at the higher freqencies is more or less the same as in the no-leakage case.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Physiologically-motivated modeling of the voice source in articulatory analysis/synthesis

متن کامل

Data-driven Voice Sourcewaveform Modelling

This paper presents a data-driven approach to the modelling of voice source waveforms. The voice source is a signal that is estimated by inverse-filtering speech signals with an estimate of the vocal tract filter. It is used in speech analysis, synthesis, recognition and coding to decompose a speech signal into its source and vocal tract filter components. Existing approaches parameterize the v...

متن کامل

Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modelling

The paper presents a voice source waveform modeling techniques based on principal component analysis (PCA) and Gaussian mixture modeling (GMM). The voice source is obtained by inverse-filteirng speech with the estimated vocal tract filter. This decomposition is useful in speech analysis, synthesis, recognition and coding. Existing models of the voice source signal are based on function-fitting ...

متن کامل

Session 4aSCb: Voice and F0 Across Tasks (Poster Session) 4aSCb7. A perceptually and physiologically motivated voice source model

Many glottal source models have been proposed, but none has been systematically validated perceptually. Our previous work showed that model fitting of the negative peak of the flow derivative is the most important predictor of perceptual similarity to the target voice. In this study, a new voice source model is proposed to capture perceptually-important source shape aspects. This new model, alo...

متن کامل

A perceptually and physiologically motivated voice source model

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Physiologically Motivated Modelling of the Voice Source in Articulatory Analysis/synthesis

نویسنده

چکیده

منابع مشابه

Physiologically-motivated modeling of the voice source in articulatory analysis/synthesis

Data-driven Voice Sourcewaveform Modelling

Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modelling

Session 4aSCb: Voice and F0 Across Tasks (Poster Session) 4aSCb7. A perceptually and physiologically motivated voice source model

A perceptually and physiologically motivated voice source model

عنوان ژورنال:

اشتراک گذاری